8 research outputs found

    Study on the phonetic pitch movement of the accentual phrase in Korean read speech

    No full text
    International audienceThe minor prosodic unit in Korean language, generally called Accentual Phrase, is usually defined by its syntactic or phonological cue. This article tries to analyze the correlation between the phonetic pitch movement and the accentual phrase boundary by means of pattern extraction and probabilistic prediction

    The AANVIS project: towards the automatic multilingual analysis of non-verbal information in speech

    No full text
    International audienceThe aim of the project is to develop an objective and empirical methodology for the analysis of non-verbal information obtained from speech corpora of several languages representative of different prosodic types. This will lead to obtaining fundamental knowledge about the prosodic systems of these languages that will be directly applicable to the automatic multilingual processing and interpretation of spoken language

    Correlating ASR Errors with Developmental Changes in Speech Production: A Study of 3-10-Year-Old European Portuguese Children's Speech

    Get PDF
    International audienceAutomatically recognising children's speech is a very difficult task. This difficulty can be attributed to the high variability in children's speech, both within and across speakers. The variability is due to developmental changes in children's anatomy, speech production skills et cetera, and manifests itself, for example, in fundamental and formant frequencies, the frequency of disfluencies, and pronunciation quality. In this paper, we report the results of acoustic and auditory analyses of 3-10-year-old European Portuguese children's speech. Furthermore, we are able to correlate some of the pronunciation error patterns revealed by our analyses - such as the truncation of consonant clusters - with the errors made by a children's speech recogniser trained on speech collected from the same age group. Other pronunciation error patterns seem to have little or no impact on speech recognition performance. In future work, we will attempt to use our findings to improve the performance of our recogniser

    Building OMProDat: an open multilingual prosodic database

    No full text
    International audienceCurrent research on speech prosody generally makes use of large quantities of recorded data. In order to provide an open multi-lingual basis for the comparative study of speech prosody, the Laboratoire Parole et Langage has begun the creation of an open database OMProDat containing recordings of 40 five sentence passages, originally taken from the European SAM project, each read by 5 male and 5 female speakers of each language. The database will contain both primary data, the recordings, and secondary data in the form of different annotation files. Currently the database contains recordings and annotations for five languages: Korean, English, French and Chinese plus a smaller subset for several languages which will be used for the TRASP workshop. All the data will be freely available on the Speech and Language Data Repository

    Building OMProDat: an open multilingual prosodic database

    No full text
    International audienceCurrent research on speech prosody generally makes use of large quantities of recorded data. In order to provide an open multi-lingual basis for the comparative study of speech prosody, the Laboratoire Parole et Langage has begun the creation of an open database OMProDat containing recordings of 40 five sentence passages, originally taken from the European SAM project, each read by 5 male and 5 female speakers of each language. The database will contain both primary data, the recordings, and secondary data in the form of different annotation files. Currently the database contains recordings and annotations for five languages: Korean, English, French and Chinese plus a smaller subset for several languages which will be used for the TRASP workshop. All the data will be freely available on the Speech and Language Data Repository
    corecore